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.PECUS Program Library Writwitp DECUS NO. S-S.,7 

CIUSI'H Cluster Analysis Program 

inis program is based on hierarchical grouping, as 

aesc^i bed in fortran for the Behavioral Sciences . 

by Donald 4- Veldman, and Is adapted from the program 
provided in that book. 

The program works as follows* Fi-r<=+- 

xuxiows. J?irst all data are read in 

° fc0red ^ 3 ^ dlmenS10na l arr ay on magnetic tape. If optioned, 

S00res are •***““«•* *0 —ore form. a S o, an option is 
provided for transposing the dat'a Matrix- to permit grouping of 

variables instead of subjects. Kext, all profiles are converted 

mto a matrix of error potentials for all pairings of these 

profiles. The next section n-e +.v 

section of the program locates the combination 

0bJe0tS . lnVOllrinS the !•«»* Increase in error. Each time 
rough this part of the program, one group (of subjects or 

variables) is combined with one other group, until all have been 
absorbed into 1 of 2 groups. 

Please note that each fimo fi,„ 

eacn time, those groups are combined 

which show the most similar nrnt'no u <. 

miiar proule, between which there exists 

the least distance. Therefore vom* kt / 

re, variables (or subjects) which 

show a high negative correlation will nnt h 

on W111 not grouped together. 

MI-NIfeUM HARDWARE 

PDP-8/1 wi th 8 K of cora wi 4-v, 4 -—t ±. 

core, with teletype, printer, .and 1 

magnetic tape drive or disk. 

OTHER PROGRAMS REEDED 

PS/8 Operating System with Fortran Compiler 




















mOi'Es 
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Intermediate date are stored either on disk or tape. The 
actual device is ASSIGNed at execution time. 

This program consists of 4 chained segments, named C1USTR, 
IRAI'ISP, MATRX1, and MATRX2, respectively .Ea.ch must be 
compiled, loaded, and saved separately on the system device. 
Specify 1 and 0 options to the loader for each. 

To allow ro execution-time-formatting of input data, load 
GLUSTR into core and change the contents of location 14671 ^ 

from 472b to 0400, before saving on the system device. For 
an explanation of execution-time format statements, see the 
accompanying write-up. 
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now to run C1USTR Cluster Analysis Program: 

1^ .ASSIGN (device) IN 

dJ * R C1USIR (Starting Address is 14161) 

5) Mount tape on unit 1 if it is device IN. It must contain 
the input lile, if input is from tape. Oherwise, tape with 
zeroed directory is preferable. 

4) iype in number of variables. Maximum is 80 if variables 

are to be clustered;' otherwise., maximum is 200 . 

■* . v . ”*■ * 

5) Iype in number of subjects. Maximum is 80 if subjects 
are to be clustered; otherwise, maximum is 200. 

6) Iype in ISTAN, the option for standardization of data. 

(l means yes, 0 means no) 

7) Type in ITRAN, transposition option. If variables are to 
be clustered, type 1. Otherwise, type 0. 

8) Type in a format statement. It may be 72 characters in 
length. Input is read in F format (no identifying 
information is read in). 

9) Type in a heading to appear on output. It may be 72 
characters in length. 

10) Type input device (1,2,3, or 4). If input is magnetic 
tape, the file must reside on unit 1 with the file name 
’CLUSIN.DA’. 


Note: She ’Read’ statement is at location 14664. If the card 
reader fails, STOP, load Address 14664, and press START. Cards 
should continue to read in. 
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Execution-time format Statements 


ExDcution-time format statements are not permitted by 
the compiler. However, by changing one location in the program 
after it is in core, it is possible to read in a format 
statement (from the teletype only), and use it for reading the 
input. Set up the program (or subprogram) as follows: 

DIMENSION ElvlT (12) 

C This is the array where-the. format statement will 

C be stored. 

READ (1,15) FMT 
15 FORMAT (12A6) 

C A format statement as long as 72 characters can 

C be read in here. 


20 READ (IN,25) DATAl,DATA2 

C Input data is read in from device IN 

25 FORMAT (F3-0) 

C This is a dummy format that will not be used. 


END 
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be sure to obiair an assembly listing of the program, 
and also a map at execution time, so that the precise location 
to be changed can be determined. 

.oad the program and all subroutines, and begin execution, 
tfhen the program pauses or waits to accept data, the appropriate 
location can be changed# 

Now look at the assembly listing. Determine the actual 
address of the first word of array PMT. (see page C-4 of 8K 
SABR Assembler manual or PS/8 manual for computing actual 
addresses.) Now look through the listing for the Fortran statement 
where the reading of the data is done, in this case, statement 
20. You will notice that in the assembled code will appear 

(call to subroutine READ 
with 2 arguments) 

(n is the device number) 

(DDDD is the address where 
the device number is stored.) 
(fa is the format number, in 
this case, 25) 

(PPPP is the address of the 
beginning of the format 
statement.) 

In our example, PPPP is the address of the first word 
of format statement 25. This is the location to be changed, 
so that it contains the address of the first word of array 
PiviT. Therefore, compute the address where PPPP appears and change 


tne iollowing: 


4033 CALL 2, READ 

6201 ARG (n 
DDDD 

6201 ARG /fa 

PPPP 
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its contents accordingly. (As a check on your computations, 
first compute the actual value FFFF,then load the address 
where -FFFF appears, and see if the contents agree with your 
compu 0 J .011 of FFFF. Then reload the address and deposit the 
address for the first word of FMT.) 

Once this one location has been changed, save the core 
image on tape. 


Altering Format Statements Already in Core 


Format statements are stored 2 characters per word in 
packed, 6-bit ASCII code. l*hus, the first half of the first 
word of a format statement is always a 50, the stripped ASCII 
code for open parenthesis. F2.0 takes 2 locations in storage, 
and appears as 0662 for F2, and 5660 for .0. Therefore, one can 
change all or any part of a format statement in core by 
determining its exact location and substituting the codes for 
the desired format. For any format statement that is likely to 
be changed in this manner, it is advisable to leave plenty of 
extra spaces when writing the program. 



# 
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